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Abstract 

Networks such as organizational network of a global company play an important 
role in a variety of knowledge management and information diffusion tasks. The nodes 
in these networks correspond to individuals who are self-interested. The topology of 
these networks often plays a crucial role in deciding the ease and speed with which 
certain tasks can be accomplished using these networks. Consequently, growing a 
stable network having a certain topology is of interest. Motivated by this, we study 
the following important problem: given a certain desired network topology, under 
what conditions would best response (link addition/deletion) strategies played by self- 
interested agents lead to formation of a pairwise stable network with only that topology. 
We study this interesting reverse engineering problem by proposing a natural model 
of recursive network formation. In this model, nodes enter the network sequentially 
and the utility of a node captures principal determinants of network formation, namely 
(1) benefits from immediate neighbors, (2) costs of maintaining links with immediate 
neighbors, (3) benefits from indirect neighbors, (4) bridging benefits, and (5) network 
entry fee. Based on this model, we analyze relevant network topologies such as star 
graph, complete graph, bipartite Turan graph, and multiple stars with interconnected 
centers, and derive a set of sufficient conditions under which these topologies emerge 
as pairwise stable networks. We also study the social welfare properties of the above 
topologies. 

Keywords: Social Networks, Network Formation, Game Theory, Pairwise Stability, Net- 
work Topology. 



1 Introduction 

A primary reason for networks such as social networks to be formed is that every person 
or node gets certain benefits from the network and these benefits take different forms in 
different types of networks. However, these benefits do not come for free. Every node in 
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the network has to pay a certain cost for maintaining hnks with its immediate neighbors or 
direct friends. This cost takes the form of time, money, or effort depending on the type of 
network. Owing to the tension between benefits and costs, self-interested or rational nodes 
think strategically while choosing their immediate neighbors. A stable network that forms 
out of this process will have a topological structure as dictated by the individual utilities 
and best response strategies of the nodes. 

Often, stakeholders such as a social network owner or a social planner, who work with the 
networks so formed, would like the network to have a certain desirable topology to facilitate 
efficient handling of knowledge management, information retrieval, and information diffusion 
tasks using the network. Typical examples of these tasks include enabling optimal commu- 
nication among nodes for maximum efficiency (knowledge management), extracting certain 
critical information from the nodes (information retrieval), broadcasting some information 
to the nodes (information diffusion), etc. If a particular topology is the most appropriate for 
the set of tasks to be handled, it would be useful to orchestrate network formation in a way 
that the required topology emerges as a stable network as a result of the network formation 
process. 

A network in the current context can be naturally represented as a graph consisting of 
self-interested agents called nodes and connections or friendships called links. Our analysis 
in this paper is based on the equilibrium notion of pairwise stability which takes into account 
bilateral deviations arising from mutual agreement of link creation between two nodes, that 
Nash equilibrium fails to capture [6]. Deletion is unilateral and a node can delete a link 
without consent from the other node. We recall the definition of pairwise stability from the 
literature. Let Uj{g) denote the utility that node j gets when the network formed is g. 

Definition 1 A network is said to be pairwise stable if it is a best response for a node 
not to delete any of its links and there is no incentive for any two unconnected nodes to 
create a link between them. So g is pairwise stable if 

(a) for each edge e = {i,j) G g, Ui{g\{e}) < Ui{g) and Uj{g\{e}) < Uj{g), and 

(b) for each edge e' = {i,j) ^ g, if Ui{g U {e'}) > Ui{g), then Uj{g U e') < Uj{g). 

We also recall another important property, namely, efficiency. 

Definition 2 A network is said to be efficient if the sum of the utilities of the nodes in 
the network is maximal. So g is efficient if it maximizes Ylji^N'^jid) > ^^^^ Z^'" networks 
g' on N, J2jeN^ji9) > J^jeN^jid')- 

We consider that all nodes are homogeneous and they have global knowledge of the 
network. 

1.1 Motivation 

One of the key problems addressed in the literature on social network formation is: given a 
set of self-interested nodes and a model of social network formation, which topologies would 
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be stable and which would be efficient. The trade-off between stability and efficiency is a 
key topic of interest and concern in the literature on network formation. 

In this paper, our focus is on the inverse problem, namely, given a certain desired net- 
work topology, under what conditions would best response (link addition/deletion) strategies 
played by self-interested agents lead to formation of a stable (and perhaps efficient) network 
with that topology. The problem becomes important because networks such as organizational 
network of a global company play an important role in a variety of knowledge management, 
information retrieval, and information diffusion tasks. The topology of these networks is 
one of the major factors that decides the ease and speed with which the above tasks can be 
accomplished. Often, a certain topology might serve the business interests of the network 
owner better. We explain this with some examples of relevant topologies shown in Figure [TJ 

Consider a network where there is a need to rapidly spread some crucial information. The 
information may be received by any of the nodes and it is important that all other nodes 
also get the information at the earliest. Also owing to the criticality of the information, it is 
desirable that there are redundant ways of communication, to take care of any link failures. 
In such complete network is ideal. 

Consider a different scenario where the information is crucial, however there needs to be 
a moderator to verify the authenticity of the information before spreading it to the other 
nodes in the network (for example, it could be a rumor). Here a star network is desirable 
as the center could act as a moderator and any information that originates in any part of 
the network has to flow through the moderator node before it can reach other nodes in the 
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Figure 1: Relevant topologies investigated in the paper 
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network. Virus inoculation is a related example where a star network is desirable since only 
the center needs to be inoculated in order to prevent spread of the virus to other parts of 
the network, thus reducing the cost of inoculation. 

Our next example concerns two communities or clusters of a society where some or all 
members of a community receive certain information simultaneously. The objective here is to 
forward the information to the other community. Moreover, it is desirable to not have intra- 
section links to save on resources. In this case, it is desirable to have a bipartite network. 
Moreover, if the information is critical and urgent, requiring redundancy, a complete bipartite 
network is desirable. A bipartite Turan network is a practical special case where both 
communities are nearly of equal sizes. 

Consider a generalization of the star network, where there are multiple centers and the 
leaf nodes are divided among the centers as evenly as possible. Such a network is desirable 
when the number of nodes is expected to be very large and there is a need for decentralization 
for efficiently controlling information in the network. We call such a network, /c-star network 
(see Figure [1]). 

It is clear that depending on the tasks for which the network is used, a certain topology 
might be better than the others. This provides the motivation for our work. 

1.2 Relevant Work 

The modeling of strategic formation in a general network setting was first studied by Jack- 
son and Wolinsky [S]. This widely cited model, however, does not capture bridging benefits. 
Jackson [S] reviews several models of network formation in the literature and highlights that 
pairwise stable networks may not exist in some settings. Aumann and Myerson [1] provide 
a sequential move game model where nodes are far-sighted, whereas Watts [H] considers a 
sequential move game model where nodes are myopic. In both of these approaches and in 
any sequential network formation model in general, the resulting network is based on the 
ordering in which links are altered and owing to random ordering, it is not clear which net- 
works emerge. Narayanam and Narahari [H] investigate the topologies of networks formed 
with a generic model of network formation based on a value function, Myerson value. Hum- 
mon [1] uses agent-based simulation approaches to explore the dynamics of network evolution 
based on the symmetric connections model [S] . Goyal and Vega-Redondo [5] propose a non- 
cooperative game model capturing bridging benefits wherein they introduce the concept of 
essential nodes, which is a part of our utility model. Doreian [2], given some conditions on 
a network, analytically arrives at specific networks that are pairwise stable. However, the 
complexity of analysis increases exponentially with the number of nodes and the analysis is 
limited to a network with only five nodes. Some gaps in this analysis are addressed by Xie 
and Cui [B], [IT]. 

The above models of social network formation assume that all nodes are present through- 
out the evolution of a network, which allows nodes to form links that may not be consistent 
with the desired network. For instance, if our desired network is a star graph, with certain 
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conditions on the network, a link between two nodes, of which one would play the role of 
the center, is desirable. However, with the same conditions, a link between other pairs is 
created with high probability, which is inconsistent with the desired star topology. Further- 
more, with all nodes present in an unorganized network, a random ordering over them in 
sequential network formation models adds to the complexity of analysis. However, in most 
social networks, not all nodes are present from beginning itself. A network starts building 
up from a few nodes and gradually grows to its capacity. Our model captures such type of 
network formation. 

There have been a few approaches earlier to design incentives for nodes so that the 
resulting network is efficient. Woodard and Parkes [15] use mechanism design to design 
incentives so that the outcome is an efficient network. Mutuswami and Winter [10] design a 
mechanism that ensures efficiency, budget balance, and equity. Though it is often assumed 
that the welfare of a network is based only on its efficiency, there are many situations where 
this may not be true. A particular network may not be efficient in itself, but it may be 
desirable for reasons external to the network, as explained in Section 11.11 

1.3 Contributions of the Paper 

In this paper, we study the inverse network formation problem, namely, under what condi- 
tions would a desired topology be obtained as a pairwise stable network when self-interested 
agents form a network by playing best response strategies. 

• First we propose a recursive model of network formation where nodes enter the network 
sequentially. The entry of a new node triggers an adjustment process in the network; 
this adjustment process continues until the network reaches a pairwise stable state. 
With this recursive model, we can guarantee that the network retains its topology in 
each of its stable states; also the analysis can be carried out independent of the current 
number of nodes in the network. The utility model we propose captures many key 
features: (a) benefits from immediate neighbors, (b) costs of maintaining links with 
immediate neighbors, (c) benefits from indirect neighbors, (d) bridging benefits, and 
(e) an entry fee for entering the network. 

• With the above proposed model, we study common and important network topologies, 
namely, star graph, complete graph, bipartite Turan graph, and /c-star graph, and 
derive sufficient conditions under which a pairwise stable network with the desired 
topology will result under the proposed model of network formation. 

• We also study the efficiency the above topologies. 

To the best of our knowledge, this is the first detailed effort in investigating the reverse 
engineering problem of obtaining a social network with desired topology. 
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Figure 2: Proposed model of network formation 



2 A Recursive Model of Network Formation 

We consider that a game is played amongst a set of self-interested nodes, where the objective 
of each node is to maximize its profits or benefits that it gets from the network. The network 
consists of n nodes at any given time, where n could vary from one to a certain maximum 
number of nodes. The game starts with one node, whose only strategy is to remain in its 
current state. The strategy of the second node is to either (a) not enter the network or 
(b) form a link with the first node. We make an intuitive assumption that in order to be 
a part of the network, the second node has to propose a link with the first node and not 
vice versa. Also, for successful link creation, utility of the first node should not decrease. 
The network formed with these two nodes evolves to a pairwise stable network, which in 
the case of two nodes, does not result in any new network. After the network reaches a 
pairwise stable state, the third node considers entering the network. This process continues, 
which thus results in the formation of a pairwise stable network of n nodes. We note that, 
in the process described above, no node in the network of — 1 nodes can create a link 
with the newly entering n^^ node until the latter successfully forms a link with one of the 
existing nodes in the network. After the new node enters the network successfully, nodes 
who get to make their move are chosen at random at all time and the network evolves until it 
reaches a pairwise stable network consisting of n nodes. Following this, a new {n + 1)*'^ node 
considers entering the network and the process goes on recursively. The assumption that a 
node considers entering the network only when it is pairwise stable might seem artificial in 
general social networks, but can be justified in organizational networks where entry of nodes 
can be controlled by a network administrator. The model is depicted in Figure [2J 

Each node has a set of strategies at any given time and it chooses its myopic best response 
strategy which maximizes its immediate utility. A strategy can be of one of the three types, 
namely (a) creating a link with a node that is not its immediate neighbor, (b) deleting a link 
with an immediate neighbor, or (c) maintaining status quo. Note that a node will compute 
whether a link it proposes decreases utility of the other node, because if it does, it is not 
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its myopic best response as the link will not be accepted by the latter. Moreover, consistent 
with the notion of pairwise stability, if a node gets to make a move and altering a link does 
not strictly increase its utility, then it prefers not to alter it. 

The proposed utility model, that we use for the purpose of our analysis, is described 
below. 

2.1 Utility Model 

As nodes have global knowledge of existing nodes in the network while making their decisions, 
for instance, creating a link with a faraway node, we propose a utility model that captures 
the global view of both indirect and bridging benefits. Our model takes the idea of essential 
nodes from the model proposed by Goyal and Vega-Redondo [3]. A node j is said to be 
essential for y and z if j lies on every path that joins y and z in the network. Whenever 
nodes y and z are directly connected, they get the entire benefits arising from the direct 
link. On the other hand, when they are indirectly connected with the help of other nodes, 
of which at least one is essential, y and z lose some fraction of the indirect benefits in the 
form of intermediation rents paid to the essential nodes without whom the communication 
is infeasible. Moreover, for simplicity of analysis, we assume that nodes that lie on path(s) 
connecting y and z, but are not essential, do not get any share of the intermediation rents. 
So, when y and z are indirectly connected with the help of other nodes of which none is 
essential, they get the entire indirect benefits arising from their connection with each other. 
In order to avoid discrete constraints on rents, such as summation of the fractions paid to 
be less than one, we assume that irrespective of the number of essential nodes connecting y 
and z, they lose the same fraction. 

We now describe the determinants of network formation that our model captures, and 
thus obtain expression for the utility function. Table [1] enlists the notation we use in the 
rest of the paper. 

2.1.1 Network Entry Fee 

Since nodes enter a network one by one, we introduce a notion of network entry fee. This fee 
corresponds to some cost a node has to bear in order to be a part of the network. It is clear 
that, if a newly entering node wants its first connection to be with an existing node which 
is of high importance or degree, then it has to spend more time or effort. So the entry fee 
that the former pays is assumed to be an increasing function of the degree of the latter, say 
d^. For simplicity of analysis, we assume the fee to be directly proportional to d^ and call 
the proportionality constant, network entry factor cq. 
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AT 
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set of nodes present in the network 


dj 
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distance between nodes j and w 


7 


fraction of indirect benefits paid to the corresponding set of essential nodes 


E{3,w) 


set of nodes essential to connect j and w 


e(j, w) 


\E{3,w)\ 


Co 


network entry factor (see Network Entry Fee) 


T(j) 


target node to which node j connects to enter the network 


I{i=NE} 


1 when j is a newly entering node about to create its first link, else it is 



Table 1: Notation for the proposed utility model 

2.1.2 Direct Benefits 

These benefits are obtained from immediate neighbors in a network. For a node j, these 
benefits equal bi times dj. 

2.1.3 Link Costs 

These costs are the amount of resources like time, money, and effort a node has to spend 
in order to maintain links with its immediate neighbors. For a node j, these costs equal c 
times dj. 

2.1.4 Indirect Benefits 

These benefits are obtained from indirect neighbors or indirect friends and these decay with 
distance, that is < bi. In the absence of rents, the total indirect benefits that a node j 
gets is J2n^^N, i{j,w)>ihj,^v)- 

2.1.5 Intermediation Rents 

Nodes pay a fraction 7 (0 < 7 < 1) of the indirect benefits, in the form of additional favors or 
monetary transfers to the corresponding set of essential nodes, if any, and the loss incurred 
by a node j due to these rents is E«,e7v, E{j,w)^<plHj,y^)- 
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2.1.6 Bridging Benefits 

In our model, a node gets bridging benefits for enabling communication between pairs of 
nodes which are otherwise disconnected. Two nodes pay a fraction 7 of the indirect benefits 
to the set of essential nodes connecting them, which is assumed to be equally divided among 
the essential nodes connecting that pair. Let a node j be one of the essential nodes connecting 
two nodes y and z. Both y and z benefit each and so the connection produces a total 
benefit of 2hi(^y^z)- Each node from the set E{y,z) gets a fraction ^^J^^ , the actual benefits 

being ^^^^^^ '^^i{y,z)- So the bridging benefits obtained by a node j from the entire network 

is Yl,y,z&N, ji^E{y,z) (i(^) 2^K3/.z)- 

2.1.7 Utility Function 

For a node j, the utility function is a function of the network, that is Uj : g ^ M. We drop 
the notation g from the following equation for readability. Summing up all the determinants 
of network formation that our model captures, the utility function for node j is given by 



w&N w£N y,zeN ^ 

lij,w)>l EU,w)j^<l, j&E{y,z) 

2.2 Dynamics of Network Formation 

The proposed model of network formation is based on a sequential move game and hence 
can be represented as an extensive form game tree. A snapshot of one such game tree is 
shown in Figure O 

2.2.1 Game Tree 

As the entry of each node in the network results in one game tree, the network formation 
process results in a series of game trees. Each node of a game tree (not to be confused with 
a node of the network) represents a network state, while each branch represents a possible 
transition from a network state, owing to decision made by a node. So, the root of a game 
tree represents the network state in which a new node is considering to enter the network. 

A general way to find an equilibrium in an extensive form game is to use backward 
induction [12]. Our game is a special case of such a game where the players have bounded 
rationality, that is their best response strategies are myopic. So instead of the regular 
backward induction approach or the bottom-up approach, we take a top-down approach, 
which results in derivation of the same set of conditions under which a network topology 
gets formed. An improving path is a sequence of networks, where each transition is obtained 
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by either two nodes choosing to add a hnk or one node choosing to delete a hnk. Thus, a 
pairwise stable network is one from which there is no improving path leaving it [7]. The notion 
of improving paths is a myopic one, and agents make their decisions of altering links without 
considering how their actions affect the decisions of other nodes and hence the evolution 
of network. Though this process of improving paths exhibits bounded rationality, it is a 
natural variation on best response dynamics and has some experimental justifications |13j . 
So, deriving using the top-down approach leads to an intuitive understanding of the dynamics 
of network formation using the notion of improving paths. 

2.2.2 Notion of Types 

As the order in which nodes take decision, is random, in a general game, the number of 
branches arising from each state in the game tree depends on the number of possible con- 
nections a node can be involved in (or number of possible connections with respect to a 
node). We say two nodes, say A and C, of a graph g are of the same type if there exists 
an automorphism / : V{g) — V{g) such that f{A) = C, where V{g) is the vertex set of g. 
The implication of nodes being of the same type is that, for any automorphism /, if the best 
response strategy of node A is to alter its link with node D, then the best response strategy 
of f{A) is to alter its link with f{D). So at any point in time, it is sufficient to consider the 
best response strategies of one node of each type. We say two connections with respect to 
a node 5, say with nodes A and C, are of the same type if there exists an automorphism / 
such that f{A) = C and f{B) = B. The implication of connections being of the same type 
with respect to a node is that, the node is indifferent between the connections, irrespective 
of the utility model. Different types of connections with respect to a node form different 
branches in the game tree. 

For example, in Figure [T](e) , nodes G and H are of the same type. Also, the two possible 
connections MG and MH with respect to node M, are of the same type. But the possible 
connections EG and EH with respect to node E, are not of the same type. So, these two 
strategies of node E, namely, connecting with nodes G and if, form different branches in 
the game tree, implying that the utilities arising from these two types of connections are 
not necessarily equal. In a network formation game with homogeneous nodes, the number 
of branches depends on the number of different types of possible connections with respect 
to a node, at that particular instant. Furthermore, as social networks inherently have low 
diameter [9] and we are primarily interested in the formation of special topologies in a 
recursive manner (nodes are already organized according to the topology and the objective 
is to extend the topology to one more node, so the existing nodes play the same role as 
before, and most or all of the existing links do not change), the number of different types 
of nodes, as well as the number of different types of possible connections with respect to a 
node, at any instant, is a small constant, thus simplifying the analysis. 
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2.2.3 Directing the Dynamics 



The procedure for deriving sufficient conditions for the formation of a given topology is 
similar to mathematical induction. Consider a base case network with very few nodes (two 
in our analysis). We derive conditions so that the network formed with these few nodes has 
the desired topology. Then using induction, we assume that a network with n — 1 nodes 
has the desired topology, and derive conditions so that, the network with n nodes, also has 
that topology. Without loss of generality, we explain this procedure with the example of star 
topology, referring to Figure [31 We derive conditions for base case in the proof of Theorem [1] 
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Figure 3: Directing the Dynamics of Star Formation 
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Now assuming that the network formed with n — 1 nodes is a star, we derive conditions so 
that the network of n nodes is also a star. 

In Figure [3l node A is the newly entering node and the network is currently in state 
where a star with n — 1 nodes is already formed. Note that in state 0, with respect to node 
A, there are two types of possible connections: (a) with the center and (b) with a leaf node. 
In states 1, 3, 4 and 5, there are two types of nodes, and two types of possible connections 
with respect to a leaf node and one with respect to the center. It will be seen that, the 
network is directed to not enter state 2, so even though there are four different types of 
nodes, it is not a matter of concern. Firstly, we want node A to connect to the center and 
enter state 1 by choosing the improving path that transits the network from state to state 
1 in Figure O So utility of node A in state 1 should be greater than that in state 0, that 
is Uyi(l) > ua{0), where u is any utility function. Equation ([T]) being one such function. 
Similarly, for node B to accept the link from node A, B^s utility should not decrease, that 
is ub{1) > ub{0). Also we do not want node A to connect to any of the leaf nodes, that 
is, we do not want the network to enter state 2. Note that there might be an improving 
path from state 2 that eventually results in a star, but as we are concerned with sufficient 
conditions, we discard state 2 in order to shorten the analysis. One way to ensure that the 
network does not enter state 2, irrespective of whether it lies on an improving path, is by 
making it less favorable for node A than the desired state 1. That is, starting from state 0, 
there exists an alternative improving path which gives node A better utility than entering 
state 2. We say that, for node A, the strategy of creating a link with any of the leaf nodes is 
dominated by the strategy of creating link with the center. For state 2 to be dominated by 
state 1, ma(2) < Another way to ensure the same is by having a condition for a leaf 

node such that, accepting a link from node A decreases its utility and so a leaf node does 
not accept the link, thus forcing node A to connect to the center, that is Uj{2) < Uj{0) for 
j = C, D, E, F. So the network enters state 1, which is our desired state. 

In order to ensure pairwise stability of our desired state, there should be no improving 
paths leading out of it, for which we need to take care of two cases. First, when node B gets 
to make its move, it can either (a) break any of its links and enter state 5 or (b) remain in 
the same state 1. As we do not want the network to enter state 5, node B's utility should 
be at least as good remaining in state 1 as entering 5, that is ^^(1) > ub{5). Second, 
when any of the leaf nodes is chosen at random, it can either (a) create a link with some 
other leaf node and enter state 3 or (b) delete its link with the center and enter state 4, or 
(c) remain in the same state 1. As we want the network to stay in state 1, the additional 
conditions respectively are Uj{l) > Uj{3) and Uj{l) > Mj(4) for j = A,C, D, E, F. Thus we 
direct the dynamics of network formation along a desired improving path by imposing a set 
of conditions ensuring that the resulting network is in the desired state or has the desired 
topology. In the following section, we analyze some common network topologies using our 
model of recursive and sequential network formation. 
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3 Sufficient Conditions for Formation of Relevant Topolo- 
gies 

In this section, we analyze the dynamics of formation of several relevant network topologies, 
namely star, complete graph, bipartite Turan graph, 2-star, and fc-star, and derive sufficient 
conditions for their formation. Note that the conditions derived for any particular network 
topology are sufficient under the given setting, and there may exist alternative conditions 
that result in the same topology. We use Equation ([1]) for mathematically deriving the 
conditions. 

Theorem 1 For a given network, if bi — 62 + 7^2 < c < 61 and Cq < (1 — 7) (^2 — ^3); the 
resulting topology is a star. 

Proof. Refer to Figure [3] throughout the proof. For the base case of n = 2, the requirement 
for the second node to connect to the first is that its utility should become strictly positive. 
Also as the first node has degree 0, there is no entry fee. 

< 61 - c c < 61 (2) 

Now, consider a star consisting of n — 1 nodes. Let the newly entering n^^ node get to make 
a decision of whether to enter the network. For n > 3, if the entering node connects to the 
center, it gets indirect benefits of 62 each from n — 2 nodes. But as the center is essential 
in order to connect the newly entering node with the other leaf nodes, the new node has to 
pay 7 fraction of these benefits to the center. Also, it has to pay an entry fee of (n — 2)co as 
the degree of center is n — 2. So in Figure [3l ua{0) < ua{1) gives 

< 61 - c + (n - 2) (1 - 7) 62 - (ri - 2)co 

^ c < 61 + (n - 2) ((1 - 7) 62 - Co) 
As it needs to be true for all n > 3, we set the condition to 

c < min \bi + {n-2) ((1 - 7) 62 - co) | 

n>3 I. J 

^ c < 61 + (1 - 7) 62 - Co (3) 

The last step is obtained so that the condition for link cost is independent of the upper limit 
on the number of nodes, by enforcing 

Co < (1 - 7) h (4) 

which enables us to substitute n = 3 and the condition holds for all n > 3. 

For the center to accept a link from the newly entering node, we need to have ub{0) < ub{1)- 
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For n = 2, the requirement for the first node to accept hnk from the second node is < 61 — c 
which is satisfied by Inequahty For n = 3, as the center is essential for connecting the 
other two nodes separated by distance two, it gets 7 fraction of 62 from both the nodes. So 
it gets bridging benefits of 2762- 

bi-c< 2{bi - c) + 27&2 
^ c <bi + 2762 

This condition is satisfied by Inequahty ([2]). For n > 4, prior to entry of the new node, 
the center alone connected ~ ^) pairs of nodes at distance two from each other, while after 
connecting with the new node, the center is the sole connection for ~ ^) such pairs. So the 
required condition: 

_ 2) {h, -c)+^(^~'^yh<{n-l){h-c)+^(^~ 262 

This condition is satisfied by Inequality (|2]) for all n > 4. 

For the newly entering node to prefer the center over a leaf node as its first connection (not 
applicable for n = 2 and 3), we need ua{X) > ua{2). 

61 - c + (n - 2) (1 - 7) 62 - (ri - 2)co > 61 - c + (1 - 7) &2 + (n - 3) (1 - 7) ^3 - Cq 

^ Co < (1 - 7) (&2 - h) (5) 

Alternatively, the newly entering node may want to connect to the leaf node, but the leaf 
node's utility decreases. In that case, the alternative condition can be Uj{2) < Uj{0) for 
j = C, D, E, F. Note that this leaf node gets bridging benefits of 2762 for being essential 
in connecting the new node with the center. Also, as it is one of the two essential nodes in 
connecting the new node with the other n — 3 leaf nodes (the other being the center), it gets 
bridging benefits of (n — 3){^)2b-s = {n — 3)763. 

61 - c + (n - 3) (1 - 7) 62 > 2(61 - c) + (n - 3) (1 - 7) 62 + 2762 + {n - 3)763 

which gives c > 61 + 2762 + {n — 3)763. But this is inconsistent with the condition in 
Inequality ([2]). So in order to ensure that the newly entering node connects to the center 
and not to any of the leaf nodes, we use Inequality ([5]) only. 

Now that a star of n nodes is formed, we ensure its pairwise stability by deriving con- 
ditions for the same. Firstly, we ensure that the center does not delete any of its links. So 
we need mb(1) > mb(5). Note that from the center's point of view, state 5 is same as state 
and as we have seen earlier that mb(0) < mb(1), the required condition ms(4) < mb(1) is 
already ensured. 

Next, no two leaf nodes should form a link between them. So we should ensure that, not 
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creating a link between them is at least as good for them as creating, that is Uj{l) > Uj{3) 
for any leaf node j. This condition is applicable for n > 3. 

61 - c + (n - 2) (1 - 7) 62 > 2(61 - c) + (n - 3) (1 - 7) 62 

^ c > 61 - 62 + 7^2 (6) 

For a leaf node to not delete its link with the center, we need Uj{l) > Uj{4) for any leaf node 
j. For n > 2, we have 

61 - c + (n - 2) (1 - 7) 62 > 

^ c < 61 + (n - 2) (1 - 7) 62 

which is a weaker condition than Inequality ([2]) for n > 2. 

Note that Inequalities ([2]) and put together are stronger than Inequahties and (jl]) 
combined. We get the required result using Inequalities (EJ, ([5]) and ([6]). ■ 

We provide proof of Theorem [3] in the Appendix. Proofs of Theorems |2] and H] use similar 
technique. 

Theorem 2 For a network, z/ c < 61 — 62 (in-d cq < (1 — 7) 62 , ^/^e resulting topology is a 
complete graph. 

Theorem 3 For a network with 7 < ^E^? ^/ &i — ^2 + 7 (3&2 ~ ^3) < c < 61 — 63 anc? 
(1 — 7) (62 — ^3) < Co < (1 — 7) 62 ; i^^e resulting topology is a bipartite Turdn graph. 

Theorem 4 Let a be the upper bound on the number of nodes that can enter the network 
and A = [| — 1] (262 — b^). Then, if (1 — 7) (62 — ^3) < Cq < (1 — 7) (62 — ^4) o,nd either 

^ < 7 < rrr fei - &2 + 7^2 + 7A < c < bi, or 

(11) 7 < min ^} anfi 61-63 + 7 (^2 + 63) < c < 61, 

i/ie resulting topology is a 2-star. 

For arbitrarily large upper bound on the number of nodes that can enter the network, 
the following corollary is immediate from the above theorem. 

Corollary 1 For a network with 7 = 0, if bi — b-^ < c < bi and 62 — 63 < cq < 62 — 64, the 
resulting topology is a 2-star. 

3.1 Base Graph 

The sufficient conditions for the formation of topologies analyzed so far, are obtained, starting 
from the base graph or the starting graph consisting of a single node (corresponding to the 
base case of formation of a network with n = 2). However, in case of some topologies, under 
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a given utility model, the conditions required for its formation on discretely small number 
of nodes, are inconsistent with that required on arbitrarily large number of nodes. Lemma [2] 
shows that k-stai is one such topology. We provide its proof in the Appendix. 

Lemma 1 Under the proposed utility model, for the entire family of k- star networks (k > 3) 
to be pairwise stable, it is necessary that 7 = and c = 6i — 63. 

Proof. We consider two scenarios sufficient to prove this. 

I) No center should delete its link with any other center: Here, only one case is enough to 
be considered, that is, when each center has just one leaf node since in all other cases, the 
benefits obtained by each center from the connection with other centers is at least as much. 
For k = 3, 

3(61 - c) + 2(1 - 7)62 + 7(1)(2)262 + |(l)(2)fe3 > 2(61 - c) + (1 - 7)^2 + (1 - 7)^2 

+ (1 - 7)63 + 7(l)(l)2fe2 + J(l)(l)263 



2 

. , 2, 

2v .V / - 3 



+J(1)(1)263 + J(1)(1)264 



which gives 



For k > 4, 



c < 61 - 63 + 7(2^2 + &3) - y &4 (7) 



{k-l + l)(6i -c) + {k-l){l- 7)62 + 7(1)(^ - 1)2&2 + |(l)(fc - 1)263 
>{k-2 + l){bi-c) + {k- 2)(1 - 7)62 + 62 + (1 - 7)&3 + l{l){k - 2)262 

+l{l)ik - 2)263 + 7(1)(1)263 + |(1)(1)264 

which gives 

c < 61 - 63 + 7(62 - 64) (8) 

II) Leaf nodes of different centers should not form a link with each other: Consider a leaf 
node. Let rrii be the number of leaf nodes connected to the center to which the leaf node 
under consideration is connected. For k > 3, 

2(61 - c) + K - 1)(1 - 7)62 + (1 - 7)63^^, - 1) + (A; - 1)62 
< 61 - c + (mi - 1)(1 - 7)62 + (1 - 7)^3 5^ + ik- 1)(1 - 7)62 

which gives 

c > 61 - 63 + l{{k - 1)62 + 63) (9) 
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The only way to satisfy Inequalities and (jH]) simultaneously is by setting 

7 = (10) 

and 

c = h-h (11) 



Lemma 2 Under the proposed network formation and utility models, starting with a network 
consisting of a single node, k-star network (k > 3) cannot be formed. 

Proof. From Lemma [H the conditions necessary for the family of A;-star networks to be 
pairwise stable are 7 = and c = bi — b^, which are sufficient conditions for the formation 
of 2-star network according to Corollary [1] ■ 

The following lemma shows a result for a more general class of network formation models 
with some restriction on the desired improving path. 

Lemma 3 Under the proposed utility model and the class of evolution models that allow 
altering at most one link at a time based on myopic best response strategies, starting with a 
network consisting of a single node, one cannot find sufficient conditions for the formation of 
a k-star network (k > 3) by directing the evolution of a network along a shortest improving 
path (where each network is obtained from the preceding network by adding a link). 

Proof. It is clear from Lemma [T] that 7 = and c = 6i — 63 are necessary for the family of 
/c-star networks {k > 3) to be pairwise stable. We consider three cases how a fc-star network 
builds. 

Case (i) The first k nodes form a complete network amongst themselves first, following 
which, other nodes enter, who then play the role of leaf nodes: Lemma |2] shows that a A;-star 
network cannot be formed if the topology is to be maintained in each pairwise stable state. 
So there exist no conditions under which a fc-star network will be formed. In the rest of the 
cases, the centers are connected to some leaf nodes and there will eventually come a time 
when two centers at distance two from each other, have to form a mutual link. 

Case (ii) At least one of these two centers have no leaf node: It can be seen that, with the 
necessary condition 7 = 0, if c > 61 — 62, no node would want to create a link with another 
node which is at distance two or more from it, if all of latter 's direct neighbors, if any, are 
direct neighbors of the former. So creating a link with the center (at distance two from it) 
having no leaf is not beneficial for the other center as c = 61 — 63. 

Case (iii) Both centers have at least one leaf node each: It can be seen that if c < 61 — 64, 
nodes who are at distance four or more benefit by creating a link between them. As the leaf 
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nodes of these two centers are at distance four from each other, it is desirable for them to 
create hnk between them as c = 61 — 63. ■ 



A possible and reasonable solution to overcome this problem is to analyze the network 
formation process, starting from some other base graph. The base graph can be obtained by 
some other method, one of which could be providing additional incentives to the nodes of 
the base graph. For instance, for analyzing the formation of /c-star, the base graph is taken 
to be the complete network on the k centers, with the centers connecting to one leaf node 
each. As the base graph consists of 2k nodes, the induction starts with the base case for 
formation of /c-star network with n = 2k + 1. Theorem [5] gives the sufficient conditions for 
the formation of /c-star network provided the network starts building itself from this base 
graph. Its proof uses technique similar to that of Theorem [31 

Theorem 5 For a network starting with the base graph for k-star (k > 3), and 7 = 0, if 
c = 61 — 63 and 62 — &3 < Co < 62 — ^4? the resulting topology is a k-star. 

Proof. It is clear from Lemma [T] that under the proposed utility model, for the family of 
/c-star networks {k > 3) to be pairwise stable, it is necessary that 7 = 0, that is, we need 
7 = in order to arrive at all possible /c-star networks for a given k, and hence forms the 
necessary part of sufficient conditions for the formation of a fc-star network. Hence, for the 
rest of this proof, we will assume 7 = 0. 

Without loss of generality, assume some indexing over the k centers from 1 to k such 
that any center connected to more leaf nodes has a higher index than any center connected 
to less leaf nodes. Let Ci be the center with index i and rrii be the number of leaf nodes it is 
connected to. Also we start with a base graph in which every center is connected to one leaf 
node and the number of leaf nodes connected to each center increases as the process goes 
on. So we have, 1 < mi < m2 < • ■ ■ < nik- 

For the newly entering node to propose entering the network: Our objective is 
to ensure that the newly entering node connects to a center with the least number of leaf 
nodes in order to maintain balance over the number of leaf nodes connected to the centers. 
Without loss of generality, assume that we want the newly entering node to connect to Ci. 
The utility of the newly entering node should be positive after doing so. 

k 

bi - c + (mi + - 1) (62 - Co) + &3 ^ mj > 

i=2 

Since the minimum value of mj is 1 for any i, the above condition is true if 

c<bi + k{b2- Co) + {k- 1)63 
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This is true if 

c < bi (satisfied by Equation (fTTl) ) 

and 

Co < 62 (12) 

The newly entering node should connect to a center with the least number of 
leaf nodes, whenever applicable: This case does not arise when all centers have the 
same number of leaf nodes. Moreover, the way we direct the evolution of the network, the 
number of leaf nodes connected to any two centers differs by at most one. Without loss of 
generality, assume that we want the newly entering node to connect to Ci. Consider a center 
Cp such that rrip = nii + 1. So the newly entering node should prefer connecting to Ci over 
connecting to Cp. 

k 

hi - c + {nil + k- l) (62 - Co) + 63 ^ > 61 - c + {rrip + A; - 1) (62 - Cq) + 63 mj 

i=2 l<i<k 

As nip = mi + 1, we have 

Co > &2 - h (13) 

For a center with the least number of leaf nodes to accept the link from the 
newly entering node: It can be easily seen that this is ensured by Equation ffTTj) . 



The newly entering node should not connect to any leaf node: It can be easily seen 
that owing to benefits degrading with distance, for the newly entering node, connecting to 
any leaf node which is connected to a center with the most number of leaf nodes strictly 
dominates connecting to any other leaf node, whenever applicable. So it is sufficient to 
ensure that the newly entering node does not connect to any leaf node which is connected 
to a center with the most number of leaf nodes. This can be done by ensuring that for 
the newly entering node, connecting to a center with the least number of leaf nodes strictly 
dominates connecting to any leaf node which is connected to a center with the most number 
of leaf nodes. 

k k-l 

bi- c + {nil + A; - 1) (&2 - cq) + 63 ^ > 61 - c + 62 - cq + (m^ + A; - 2)63 + 64 ^ mi 

i=2 i=l 

We need to consider two cases (i) = mi + 1 and (ii) m^ = mi 
Case (i) m^ = mi + 1: Substituting this value of m^ gives 

fc-i 

(mi + k - l){b2 - Co) + (63 - b^j^nii + mi(63 - 64) + 63 > ^2 - co + (mi + k - 1)63 

i=2 
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As the minimum value of ^^=2^ rrii is k 
by k — 2. Further simphfication gives 



2, the above remains true if we replace 



k-l 
1=2 



(mi + k 



2) (62 



bi - Co) > 



Since mi + — 2 > is positive, we must have 



Co < &2 



(14) 



Case (ii) m^ = mi: It can be similarly shown that Equation fll4p is the sufficient condition. 

Now that the newly entering node enters in a way such that /c-star network is formed, we 
have to ensure that no further modifications of links occur so that the network formed thus 
is pairwise stable. 

For centers and the corresponding leaf nodes to not delete the link between 
them: It can be easily seen that c < 61, a weaker condition than Equation ffTTl) . is a suffi- 
cient condition to ensure this. 

No center should delete its link with any other center: This is ensured by Inequali- 
ties ([7]) and ([H]) as explained in Lemma [TJ 

Leaf nodes of a center should not form a link with each other: The net benefit that 
a leaf node gets by forming such a link should be non-positive. 



^ c>bi-b2 

which is satisfied by Equation ([TTj) . 

Leaf nodes of different centers should not form a link with each other: This is 
ensured by Inequality ([9]) as explained in Lemma [H 

Link between a center and leaf node of other centers should not be created: Let 

Ci be the center under consideration and the leaf node under consideration be connected 
to Cj {j 7^ i). There are two ways to ensure this. First is to ensure that a center neither 
proposes nor accepts a link with a leaf node of other centers. This mathematically is 



61 - c - 62 < 



(A; - 1 + mi){bi - c) + 52^"^^ > {k - 1 + rrii + l){bi - c) + b2C^mk - 1) 




c> 61 - 62 
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An alternative to this condition is to ensure that a leaf node neither proposes nor accepts 
a link with a center to which it is not connected, but since this condition is already satisfied 
by Equation (ITT]) , this alternative need not be considered. 

Equations ( 1T0|) . (ITT]) . (|T2|) . (|T3|) and (|T4|) give the required sufficient conditions for the 
formation of a k-star network. ■ 



3.2 Intuition Behind the Results 

The network entry fee has an impact on the resulting topology as seen from the above 
theorems. For instance, in Theorems [1] and [3l the intervals spanned by the values of c and 
7 may intersect, but the values of network entry factor cq span mutually exclusive intervals 
separated at (1 — 7)(&2 — ^s)- In case of star, cq is low and a newly entering node can afford 
to connect to the center, which in general, has very high degree. In case of bipartite Turan 
graph, it is important to ensure that the sizes of the two partitions are as equal as possible. 
As Co is high, a newly entering node connects to a node with a lower degree, whenever 
applicable, that is, to a node that belongs to the partition with more number of nodes, and 
hence the newly entering node potentially becomes a part of the partition with fewer number 
of nodes, thus maintaining a balance between sizes of the two partitions. In case of fc-star, 
the objective is to ensure that a newly entering node connects to a node with moderate 
degree, that is, the network entry factor is not so high that a newly entering node prefers 
connecting to a leaf node and also not so low that it prefers connecting to a center with 
the highest degree. This intuition is clearly reffected in Theorems S] and [5] where Cq takes 
intermediate values. In general, a high value of network entry factor cq lays the foundation 
for formation of a regular graph. 

It is clear that a complete network is formed when the costs of maintaining links is 
extremely low, as reffected in Theorem |2j The remaining topologies are formed in the 
intermediate ranges of c. From Theorems |3l S] and |5l it can be seen that the feasibility of 
a network being formed depends on the values of 7 also, which arises owing to contrasting 
densities of connections in a network. For instance, in a bipartite Turan network, nodes 
from different partitions are densely connected with each other, while that from the same 
partition are not connected at all. Similarly, in a /c-star network, there is an extreme contrast 
in the densities of connections (dense amongst centers and sparse for leaf nodes). 

3.3 Connection to Efficiency 

In this section, we analyze the efficiency of the concerned networks. The conditions are 
sufficient and not necessary, and so there may exist other sets of conditions that result in 
a given topology. We analyze the efficiency based on the assumption that the networks are 
formed using the derived conditions. 
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Prom Equation ([T]), the intermediation rents are transferable among the nodes, and so 
do not affect the efficiency of a network. Furthermore, the network entry fee is paid by any 
node at most once, and so does not account for efficiency in the long run. So the expression 
for efficiency of a network is 



Lemma H] follows from the analysis of efficient networks by Narayanam and Narahari [TT] . 

The null graph in the proposed model of recursive network formation corresponds to a 
single node to which no other node prefers to connect, and so the network does not grow. 

Theorem 6 Based on the derived sufficient conditions, null graph, star graph, and complete 
graph are efficient. 

Proof. It is easy to see that irrespective of the value of Cq, if c > 61, no node, external to the 
network, connects to the only node in the network and hence, does not enter the network. 
Such a network is trivially efficient as in the range c > 61, it is a star of one node and also 
a null graph. It is also clear that the star topology and the complete graph are efficient as 
the conditions on c from Theorems [T] and |2] form a subset of the range of c in which these 
topologies are respectively efficient. ■ 

Theorems [7] and M give bounds on the efficiency of bipartite Turan network and A;-star 
network, respectively. We provide their proofs in the Appendix. 

Theorem 7 Based on the derived sufficient conditions, for fi sufficiently large, efficiency 
of bipartite Turan network is half of that of the efficient network in the worst case and the 
network is close to being efficient in the best case. 

Proof. Here, we make a reasonable assumption that the number of nodes in the network is 
large. Hence we can assume that n is even without loss of accuracy. The sum of utilities of 
nodes in a bipartite Turan network with even number of nodes is 




\ lij,w)>l J 



Lemma 4 Let (i be the number of nodes in the network. 

(i) If c < bi — 62; then complete graph is uniquely efficient. 

(a) If bi — b2 < c < bi + (^^) then star is the unique efficient topology. 

(Hi) If bi + (^^) &2; then null graph is uniquely efficient. 
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From Lemma HI star network is efficient in the range of c derived in Theorem [31 So, to 
get the efficiency of the bipartite Turan network relative to the star network, we divide the 
above sum by the sum of utihties of nodes in a star network, which is 

2{f^-l){b,-c)+(^^~^yb, (15) 

Using the assumption that fi is large and the fact from the sufficient conditions that 62 
is comparable to bi — c, it can be shown that the efficiency relative to the star network, 
approximately is 

1 bi-c 
2^ 2b2 

As the range of c in Theorem [3] depends on the value of 7, the values of c are bounded by 
bi — 62 and bi — 63. So the efficiency is bounded by 1 and + of that of the star 
network, which can take a minimum value of | of that of star network when 63 << 62- B 



Theorem 8 Based on the derived sufficient conditions, for fi sufficiently large, efficiency of 
k-star network is ^ of that of the efficient network in the worst case and the network is close 
to being efficient in the best case. 

Proof. We make the reasonable assumption of n being large. In particular, n » k (not 
necessarily >> fc^). Hence we can assume that fj, is divisible by k without loss of accuracy. 
The sum of utilities of nodes in such a k-stai network is 

2 (^2) (^1 - c) + 2(/i - fc) (61 -c) + 2k(^l- {fi - k)b2 

1 - 7 ) - ^)^3 



From Lemma HI star network is efficient in the range of c derived in Theorems H] and [51 So, 
to get the efficiency of the fc-star network relative to the star network, we divide the above 
sum by Expression (1151) . Using the assumption that /i is large and the fact from the sufficient 
conditions that 62 and 63 are comparable to bi — c, it can be shown that the efficiency relative 
to the star network, approximately is 

V k) b2 

As 63 is bounded by and 62, the efficiency of /c-star is bounded by \ and 1 of that of the 
star network. ■ 
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4 Discussion and Future work 



We proposed a model of recursive network formation where nodes enter a network sequen- 
tially, thus triggering evolution of the network each time a new node enters. Though we have 
assumed a sequential move game model with myopic nodes and pairwise stability as the so- 
lution concept, the model, as depicted in Figure |21 is independent of the model of network 
evolution, the solution concept used for equilibrium state, and also the utility model. The 
recursive nature of our model enabled us to directly analyze the network formation game 
using an elegant induction based technique. We derived sufficient conditions by directing 
the dynamics of network formation along a desired improving path in the sequential move 
game tree. 

4.1 Future work 

This work proposed a way to derive conditions on cost for maintaining link with an immediate 
neighbor and also for entering a network, under which a desired network topology is obtained. 
Going a step further, it would be interesting to design incentives such that agents in a network 
comply with these conditions. The proposed model of network formation can be extended 
to other utility models to investigate the formation of interesting topologies under them. 
Our analysis ensures that irrespective of the chosen node at any point in time, the network 
evolution is directed as desired. A possible solution for simplifying the analysis for more 
involved topologies is to carry out probabilistic analysis for deriving conditions so that a 
network has the desired topology with high probability. Another interesting direction, from 
a practical viewpoint, is to study the problem of forming networks where the topology need 
not be exactly the one which is ideally desirable, for example, a near-/c-star network instead 
of a precise /c-star. 
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APPENDIX 



Theorem [31 For a network with 7 < ^z^; — &2 + 7 (3^2 — ^3) < c < fei — 63 and 
(1 — 7) (62 — ^3) < Co < (1 — 7) 62, the resulting topology is a bipartite Turdn graph. 

Proof. We first derive conditions for pairwise stability of a bipartite Turan network, that is 
assuming that such a network is formed, what conditions are required so that there are no 
incentives for any two unconnected nodes to create a link between them and for any node 
to delete any of its links. Note that these conditions can be integrated in the later part of 
the proof within different scenarios that we consider. 

In what follows, pi is the size of the partition constituting the node taking its decision, p2 
is the size of the other partition and n = pi + p2 is the number of nodes in the network. 
We need to consider cases for some discretely small number of nodes owing to the nature 
of essential nodes, after which the analysis holds for arbitrarily large number of nodes. For 
brevity, we present the analysis for the base case and a generic case in each scenario, omitting 
presentation of discrete cases. 

No two nodes belonging to the same partition should create a link between them: 

That is, their utility should not increase by doing so. This is not applicable for n = 2. 
For n = 3, 

2(61 -c) < 61 -c + (1 -7)62 

^ c > 61 - 62 + 7^2 (16) 

For n > 4, 

(P2 + l){bi - c) + (pi - 2)62 < P2{bi - c) + (pi - 1)62 
^ c>bi-b2 
which is a weaker condition that Inequality f|T6|) . 

No node should delete its link with any node belonging to the other partition: 

That is, their utility should not increase by doing so. 
For n = 2, 

< 61 - c 

^ c < 61 (17) 

For n > 6, 

{P2 - l){bi - c) + {pi - 1)62 + &3 < P2{bi - c) + {pi - 1)62 

^ c < 61 - 63 (18) 

It can be shown that conditions for the discrete cases n = 3,4,5 are satisfied by Inequal- 
ity dUD. 
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In the process of formation of a bipartite Turan network, at most four different types of 
nodes exist at any point in time. 



I 


newly entered node 


II 


nodes connected to the newly entered node 


III 


nodes in the same partition as II, but not of Type II 


IV 


rest of the nodes 



The notation we use while deriving the sufficient conditions are as follows: 



k 


number of nodes of Type II 


n 


number of nodes in the network, including new node 


mi 


number of nodes of Types II and III put together 


m2 


number of nodes of Type IV 



For the newly entering node to enter the network: Its utility should be positive after 
doing so. Also, in case of even n, for the new node to be a part of the smaller partition, its 
first connection should be a node belonging to the larger partition. So for k = 0, we have 
For n > 2, 

Ti Ti 

h-c+\--l] ((l-7)62-co)+ L2-IJ (1-7)^3 >0 
It can be seen that the condition is the strongest when n = 2 whenever 

Co < (1 - l)h2 (19) 

The condition thus becomes 

c < 61 

which is satisfied by Inequality f lTS]) . 

The utility of a node in the larger partition, whenever applicable, should not 
decrease after accepting link from the new node: 

For n = 2, 

61 - c > 
c <hi 

For n > 5, 

in Tt Tt Tt T) ft 

TI Ti 

^c<h + 2^\--l'\b2+[--l\h 
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The conditions for these as well as the discrete cases n = 3, 4 are satisfied by Inequality ffTSj) . 

The new node should connect to a node in the larger partition, whenever appli- 
cable: One way to see this is by ensuring that this strategy strictly dominates connecting 
to a node in the smaller partition. This scenario arises for even values of > 4. 

6i - c + - l) ((1 - 7)&2 - Co) + - l) (1 - 7)&3 
>h-c+Q) ((1 - 7)62 - Co) + - 2) (1 - 7)&3 

^Co>(l-7)(&2-fe3) (20) 

An alternative condition would be such that the utility of a node in the smaller partition 
decreases if it accepts the link from the new node, thus forcing the latter to connect to 
a node in the other partition. But it can be seen that this condition is inconsistent with 
Inequality (ITS]) and so we use Inequality (|2U]) to meet our purpose. 

Type I node should prefer connecting to a Type III node, if any, than remaining 
in its current state: For k > 2, this scenario does not arise for n < Q. For n > 6, 

{k + l)(6i - c) + m2&2 + {mi - k - 1)63 > k{bi - c) + m2b2 + (mi - k)b3 

^ c < 61 - 63 (21) 

Now for k = 1, this scenario does not arise for n = 2, 3. 
For n > 4, 

2(61 - c) + 772262 + (mi - 2)63 > 61 - c + (1 - 7)771262 + (1 - 7)("^i - 1)63 

c < 61 - 63 + 7(771262 + {fni - 1)63) 

Note that as > 4, we have rrii > 2 and m2 > 1 and so the above condition is weaker that 
Inequality (12T]) . 

It is also necessary that utility of Type III node does not decrease on accepting link from 
Type I node. In fact, when the former gets a chance to move, we derive conditions so that 
it also volunteers to create a link with the later. 

The utility of Type III node should increase if it successfully creates a link with 
Type I node: When k = 1, the case does not arise for n = 2,3. 
For 72 > 6, 

(7772 + l)(6i - c) + (mi - 1)62 > m2(6i - c) + (mi - 1)62 + (1 - 7)63 
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^ c < bi - bs + 'yh 

The conditions obtained from discrete cases n = 4, 5 are weaker than this one. 
For k > 2, this case does not arise for n < 6. 
For n > 6, 

(ma + - c) + (mi - 1)62 > m2(6i - c) + (mi - 1)62 + b-s 

^ c < bi - 

The conditions for all cases are satisfied by Inequality (^T^. 

Type III node should not delete its link with Type IV node: This can be assured if 
this strategy is dominated by its strategy of forming a link with Type I node. This scenario 
does not arise for n = 2, 3. The conditions for the discrete cases n = 4, 5, 6 are weaker than 
that for n > 7. 
For n > 7, 

(m2 + l)(6i - c) + (mi - 1)62 > (^2 - l)(6i - c) + 63 + (mi - 1)62 + (1 - 7)^3 

^ c < 61 - 63 + 

For k > 2, the cases applicable are n > 6. The condition for discrete case n = 6 is weaker 
than the following condition. 

For n > 7, 

(m2 + l)(6i - c) + (mi - 1)62 > (^2 - l)(6i - c) + 63 + 63 + (mi - 1)62 

c < 61 - 63 

Hence, all conditions for this scenario are satisfied by Inequality (^T^. 

Type III node should prefer connecting to Type I node than to another Type 
III node: This does not arise for n < 6. When k = 1, 
For n > 6, 

(m2 + l)(6i - c) + (mi - 1)62 > (^2 + l)(6i - c) + (mi - 2% + (1 - 7)^3 

^ 62 > (1-7)^3 

which is always true. For k > 2, 
For n > 6, 

(m2 + l){bi - c) + (mi - 1)62 > {m2 + l)(&i - c) + (mi - 2)62 + &3 

<^ 62 > &3 
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which is always true. 

Type IV node should not delete its link with Type III node: That is, its utility 
should not increase by doing so. This does not arise for n < 4. 
For n > 7, 

(mi - l)(6i - c) + (ma - 1)62 + (1 - 7)^2 + 63 < rni{bi - c) + (ms - 1)62 + (1 - 7)^2 

^ c<bi-bs 

The conditions for discrete cases n = 4,5,6 are weaker than the above condition. For k >2, 
the new cases are n > 6, where the discrete case n = 6 result in conditions weaker than the 
following one. 
For n > 7, 

(mi - l){bi - c) + (m2 - 1)62 + 62 + ^3 < rni{bi - c) + (m2 - 1)62 + ^2 

^ c<bi-bs 

It can be seen that all conditions of this scenario are satisfied by Inequality f l2T|) . 

Type IV node should also not break its link with Type II node: That is, its utility 
should not increase by doing so. For k = 1, 
For n > 6, 

(mi - l)(6i - c) + (ms - 1)62 + (1 - l)h + (1 - 7)^3 < "^i(^i - c) + (ms - 1)62 + (1 - 7)^2 

^ c < 61 - 63 + (1 - 7)(^2 - bi) + 763 

The discrete cases n = 3, 4, 5 result in weaker conditions than this. For k > 2, 
For n > 6, 

(mi - l){bi - c) + (m2 - 1)62 + &2 + &3 < mi (61 - c) + (m2 - 1)62 + h 

c<bi-b-i 

The conditions are satisfied by Inequality ([21]). 

Type I node should not propose a link to a Type IV node: One way is to ensure 
that this strategy of Type I node is dominated by its strategy to propose a link to a Type 
III node. It can be seen that for k > 2 and n > 6, this translates to 

{k + l){bi - c) + m262 + (mi - A; - 1)63 > (A; + l)(6i - c) + (m2 - 1)62 + {mi - k)b2 
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^b2-h> {mi - k){h2 - h) 

which is not true for mi > k. 

So we look at the alternative condition that the utility of Type IV node decreases if it accepts 
the link from Type I node, and as Type I node computes this decrease in utility, it will not 
propose a link to Type IV node. First, we consider k = 1. The discrete case n = 4 gives the 
following condition. 

3(6i - c) + 2762 + 2762 < 2(61 - c) + (1 - 7)62 + 2762 + ih 

^ 61 - 62 + 7(3&2 - h) (22) 

The other discrete cases n = 3, 5 result in weaker conditions than the above. 
For n > 6, 

(mi + l)(6i - c) + (ma - 1)62 < "^1(^1 - c) + (m2 - 1)&2 + (1 - 7)^2 

c> 61 - 62 + 7^2 

which is a weaker condition than Inequality f l2^ . Now for A; > 2, n = 4, 5 correspond to 
pairwise stability conditions and cases n < 4 are not applicable. 
For n > 6, 

(mi + l)(&i - c) + (m2 - 1)62 < mi{bi - c) + (m2 - 1)62 + &2 

4^ bi-b2 

which is satisfied by Inequality f l22p . 

Type IV node should not propose a link to Type I node: This scenario is essentially 
equivalent to the previous one scenario of utility of Type IV node decreasing due to link 
with Type I node, with the equalities permitted. So these result in weaker and hence no 
additional conditions. 

Type III node should not propose a link to Type II node: One way is to ensure that 
for Type III node, connecting to Type II node is strictly dominated by connecting to Type 
I node. It can be seen that for k >2 and n > 6, this translates to 

(ms + l)(fei - c) + (mi - 2)62 + b2< (ms + l)(6i - c) + (mi - 1)62 

which gives > 0. So we need to use the alternative condition that the utility of Type II 
node decreases on accepting link from Type III node. For k = 1, 
For n = 4, 

3(61 - c) + 4762 < 2(&i - c) + (1 - 7)62 + 27^2 + ih 
<^ c> 61 - 62 + 7(3^2 - ^3) 
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which is same as Inequahty ( 122]) . 

For n > 5, 

(m2 + 2) (61 - c) + (mi - 2)62 + 27(m2 + 1)62 + 27(^1 - 2)63 
< (m2 + l){bi - c) + (mi - 1)&2 + 2'jm2b2 + 27(mi - 1)63 

<^ c> 61 - 62 + 27(62 - h) 

which is a weaker condition than Inequahty ( I22p . Now for > 2, the only new case is the 
following. 

For n > 6, 

(m2 + 2) (61 - c) + (mi - 2)62 < (?Ti2 + l)(6i - c) + (mi - 1)62 

^ bi-b2 

which is satisfied by Inequality f l22|) . 

Type II node should not propose a link with Type III node: This is essentially 
equivalent to the above scenario of utility of Type II node decreasing due to link with Type 
III node, with the equalities permitted. So these result in weaker and hence no additional 
conditions. 

No Type II node should delete link with Type IV node: First, we consider k = 1. 
For n > 7, 

(m2 - l)(6i - c) + (61 - c) + (mi - 1)62 + 63 + 27(^^2 - 1)^2 + 27(mi - 1)63 + 2764 
< m2(6i - c) + (61 - c) + (mi - 1)62 + 27m262 + 27(mi - 1)63 

^ c < 61 - 63 + 27(62 - 64) 

This as well as all discrete cases n < 7 are satisfied by Inequality (121 p . 

For k > 2, the cases of = 4, 5 correspond to pairwise stability condition that we have 

already considered, while cases n < 4 are not applicable. 

For n > 6, 

m2(6i - c) + (mi - 1)62 + 63 < {^2 + l)(6i - c) + (mi - 1)62 

c < 61 - 63 

which is satisfied by Inequality ( 12T|) . 

Two Type IV nodes should not create a mutual link: That is their utilities should 
not increase by doing so. When = 1, it is not applicable for n < 5. Also, the discrete case 
n = 5 results in the same condition as below. 
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For n > 6, 

(mi + l)(fei - c) + (m2 - 2)62 + (1 - 7)&2 < - c) + (ms - 1)62 + (1 - 7)^2 

c > 61 - 62 

For k > 2, n = 5 corresponds to pairwise stability condition. 
For n > 6, 

(mi + l)(6i - c) + (m2 - 2)62 + &2 < mi (61 - c) + (m2 - 1)62 + &2 

c > 61 - 62 

These are weaker conditions than Inequahty ( 122|) . 

No two Type II nodes should create a link between themselves: This only applies 
to /c > 2. Also n = 4, 5 result in pairwise stability condition. 
For n > 6, 

(m2 + 2) (61 - c) + (mi - 2)&2 < (^2 + l)(&i - c) + (mi - 1)62 

<^ c > 61 - 62 
which is a weaker condition than Inequality (122|) . 

Link between Type I node and Type II node should not be deleted: It is clear that 
it will not be deleted as such a link is just formed with no other changes in the network. 



Inequalities fl2T|) and f l22|) are stronger conditions than Inequalities f[T6|) . ffTTj) and (118 
Furthermore, for non-zero range of c, from Inequalities f l2T|) and f l22|) . we have 



The required sufficient conditions are obtained by combining Inequalities (HM . (HU^i . (1^ . 
(El and (El. ■ 
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